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Abstract. In this paper we formulate and prove a statistical version of the 
Candes-Tao restricted isometry property (SRIP for short) which holds in gen- 
eral for any incoherent dictionary which is a disjoint union of orthonormal 
bases. In addition, we prove that, under appropriate normalization, the eigen- 
values of the associated Gram matrix fluctuate around A = 1 according to the 
Wigner semicircle distribution. The result is then applied to various dictio- 
naries that arise naturally in the setting of finite harmonic analysis, giving, in 
particular, a better understanding on a remark of Applebaum-Howard-Searle- 
Calderbank concerning RIP for the Heisenberg dictionary of chirp like func- 
tions. 



0. Introduction 

Digital signals, or simply signals, can be thought of as complex valued functions 
on the finite field F p , where p is a prime number. The space of signals H = C (F p ) 
is a Hilbert space of dimension p, with the inner product given by the standard 
formula 

(/,<?>= £ f{t)W)- 

tew p 

A dictionary 2) is simply a set of vectors (also called atoms) in TL. The number 
of vectors in D can exceed the dimension of the Hilbert space H., in fact, the most 
interesting situation is when \D\ p = dim7Y. In this set-up we define a resolution 
of the Hilbert space H via D, which is the morphism of vector spaces 

9 : C(£) -► H, 

given by (/) = E v es / (v) V> f° r every / € C (D). A more concrete way to think 
of the morphism is as a p x \D\ matrix with the columns being the atoms in ®. 

In the last two decades [T3] , and in particular in recent years [7J [HI M HH] , 
resolutions of Hilbert spaces became an important tool in signal processing, in 
particular in the emerging theories of sparsity and compressive sensing. 

0.1. The statistical restricted isometry property. A useful property of a res- 
olution is the restricted isometry property (RIP for short) defined by Candes-Tao 
in [}J]. Fix a natural number n G N and a pair of positive real numbers Si, 62 £ M>o. 

Definition 0.1. A dictionary D satisfies the restricted isometry property with co- 
efficients (Si,S2,n) if for every subset S G D such that \S\ < n we have 

(1 - s 2 ) 11/11 < ||e (/)||< (1 + 50II/H, 
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for every function f £ C (53) which is supported on the set S . 

Equivalently, RIP can be formulated in terms of the spectral radius of the cor- 
responding Gram operator. Let G (S) denote the composition Q* s o Q s with 65 
denoting the restriction of to the subspace Cs (23) C C (23) of functions sup- 
ported on the set S. The dictionary 23 satisfies (Si, S2,n)-RIP if for every subset 
S C D such that \S\ < n we have 

5 2 < \\G(S)-Id s \\<S 1 , 

where Ids is the identity operator on (23). 

It is known [U [TUj that the RIP holds for random dictionaries. However, one 
would like to address the following problem [21 [111 [III HIl I2H 123 123 UHl US] : 

Problem 0.2. Find deterministic construction of a dictionary 23 with |23| 3> p 
which satisfies RIP with coefficients in the critical regime 

(0.1) Si, 62 <C 1 and n — a ■ p, 

for some constant < a < 1. 

0.2. Incoherent dictionaries. Fix a positive real number fj, £ M. >0 . The following 
notion was introduced in [HHH] and was used to study similar problems in [2"5ll2T)] : 

Definition 0.3. A dictionary 23 is called incoherent with coherence coefficient \x 
(also called fx-coherent) if for every pair of distinct atoms tp, 4> £ 23 

\M)\<A=- 
VP 

In this paper we will explore a general relation between RIP and incoherence. 
Our motivation comes from three examples of incoherent dictionaries which arise 
naturally in the setting of finite harmonic analysis (for the sake of completeness we 
review the construction of these examples in Section [5]) : 

• The first example [HI [20], referred to as the Heisenberg dictionary ®h, 
is constructed using the Heisenberg representation of the finite Heisenberg 
group H (F p ). The Heisenberg dictionary is of size approximately p 2 and 
its coherence coefficient is /i = 1. 

• The second example [16, 18 , which is referred to as the oscillator dictionary 
23o, is constructed using the Weil representation of the finite symplectic 
group SL2 (F p ). The oscillator dictionary is of size approximately p 3 and 
its coherence coefficient is /J, = 4. 

• The third example [16[ I18j , referred to as the extended oscillator dictio- 
nary Deo j is constructed using the Heisenberg- Weil representation of the 
finite Jacobi group J (F p ) = SL2 (F p ) tx H (¥ p ). The extended oscillator 
dictionary is of size approximately p 5 and its coherence coefficient is /1 = 4. 

The three examples of dictionaries we just described constitute reasonable can- 
didates for solving Problem 10.21 They are large in the sense that |23| 3> p, and 
empirical evidences suggest (see [2] for the case of 23 h) that they might satisfy RIP 
with coefficients in the critical regime (|0.1[) . We summarize this as follows: 

Question: Do the dictionaries 23jj,23o and 23^o satisfy the RIP with coef- 
ficients Si, 82 *C 1 and n — a ■ p, for some < a < 1? 



STATISTICAL RESTRICTED ISOMETRY PROPERTY AND SEMICIRCLE DISTRIBUTION 3 



0.3. Main results. In this paper we formulate a relaxed statistical version of RIP, 
called statistical isometry property (SPJP for short) and we prove that it holds for 
any incoherent dictionary T> which is, in addition, a disjoint union of orthonormal 
bases: 

(0.2) £ = J] B x , 

where B x = {b\, .., b^} is an orthonormal basis of H, for every x £ X. 

0.3.1. The statistical restricted isometry property. Let D be an incoherent dictio- 
nary of the form (|0.2p . Roughly, the statement is that for S C £>, \S\ = n 
with n = p 1_e , for < e < 1, chosen uniformly at random, the operator norm 
||G (S) — Ids\\ 1S small with high probability. 

Theorem 0.4 (SRIP property). For every k e N, there exists a constant C (k) 
such that the probability 

(0.3) Pr (\\G(S) - Id s \\ > P~ E/2 ) < C (k) p 1 ^ 2 . 

The above theorem, in particular, implies that the probability Pr (||G (S) — Id,s\\ >p~ E ^ 2 ) 

as p — + oo faster then p~ l for any / G N. 

0.3.2. The statistics of the eigenvalues. A natural thing to know is how the eigen- 
values of the Gram operator G (S) fluctuate around 1. In this regard, we study the 
spectral statistics of the normalized error term 

E(S) = (P/") 1/2 (G(S)-Ids). 

Let Pe(5) = n 1 S"=i denote the spectral distribution of E (S) where Xi, 

1 = l,..,n, are the real eigenvalues of the Hermitian operator E(S*). We prove 
that p E converges in probability as p — * oo to the Wigner semicircle distribution 
Psc ( x ) = (2 ?r ) V4 — x 2 ■ l[2,-2] {%) where l[2,-2] is the characteristic function of 
the interval [—2,2]. 

Theorem 0.5 (Semicircle distribution). We have 

Pr 

(0.4) lim p E = p sc . 

p — >oo 

Remark 0.6. A limit of the form is familiar in random matrix theory as the 

asymptotic of the spectral distribution of Wigner matrices. Interestingly, the same 
asymptotic distribution appears in our situation, albeit, the probability spaces are of 
a different nature (our probability spaces are, in particular, much smaller). 

In particular, Theorems 10.41 ltT5l can be applied to the three examples T>h, £>o 
and T>EOi which are all of the appropriate form (|0.2[) . Finally, our result gives new 
information on a remark of Applebaum-Howard-Searle-Calderbank [2] concerning 
RIP of the Heisenberg dictionary. 

Remark 0.7. For practical applications, it might be important to compute ex- 
plicitly the constants C (fc) which appears in W. 3\) . This constant depends on the 
incoherence coefficient fi, therefore, for a fixed p, having /i as small as possible is 
preferable. 
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0.3.3. Structure of the paper. The paper consists of four sections except of the 
introduction . 

In Section [TJ we develop the statistical theory of systems of incoherent orthonor- 
mal bases. We begin by specifying the basic set-up. Then we proceed to formulate 
and prove the main Theorems of this paper - Theorem 11-21 Theorem 11.31 and The- 
orem 11.41 The main technical statement underlying the proofs is formulated in 
Theorem 11.51 In Section [2j we prove Theorem 11.51 In Section [3j we review the 
constructions of the dictionaries Dh, ®o and Dbo- Finally, in Appendix lAl we 
prove all technical statements which appear in the body of the paper. 

Acknowledgement 0.8. It is a pleasure to thank our teacher J. Bernstein for his 
continuos support. We are grateful to N. Sochen for many stimulating discussions. 
We thank F. Bruckstein, R. Calderbank, M. Elad, Y. Eldar, R. Kimmel, and A. 
Sahai for sharing with us some of their thoughts about signal processing. We are 
grateful to R. Howe, A. Man, M. Revzen and Y. Zak for explaining us the notion 
of mutually unbiased bases. 

1. THE STATISTICAL THEORY OF INCOHERENT BASES 

1.1. Standard Terminology. 

1.1.1. Terminology from asymptotic analysis. Let {a p } , {b p } be a pair of sequences 
of positive real numbers. We write a p = O (b p ) if there exists C > and P 6 N 
such that a p < C ■ b p for every p > Pq. We write a p = o (b p ) if linip^oo a p /b p = 0. 
Finally, we write a p ~ b p if linip^oo a p /b p = 1. 

1.1.2. Terminology from set theory. Let n G N>i. We denote by [l,n] the set 
{1, 2, .., n}. Given a finite set A, we denote by |^4| the number of elements in A. 

1.2. Basic set-up. 

1.2.1. Incoherent orthonormal bases. Let {(T~l P , (—, —) p )} be a sequence of Hilbert 
spaces such that dim H p = p. 

Definition 1.1. Two (sequences of) orthonormal bases B p ,B' p of TL p are called 
^.-coherent if 

KM'>I<-^, 

for every b € B p and b' € B' p and /j, is some fixed ( does not depend on p) positive 
real number. 

Fix /i S M >0 . Let {X p } be a sequence of sets such that lim^oo \X P \ = oo (usually 
we will have that p = o(\X p \)) such that each X P parametrizes orthonormal bases 
of Ti. p which are /z-coherent pairwisely., that is, for every x € X p , there is an 
orthonormal basis B x = {b\., .., bP.} of H p so that 

JL 

VP' 



(i-i) |M>|< 



for every x ^ y e X p . Denote 

S P = U B x . 

The set D p will be referred to as incoherent dictionary or sometime more pre- 
cisely as /x-coherent dictionary. 
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1.2.2. Resolutions of Hilbert spaces. Let O p : C ('Dp) — > Ti. P be the morphism of 
vector spaces given by 

©p(/)= £ f(b)b. 



The map Q p will be referred to as resolution of Tip via D p . 
Convention: For the sake of clarity we will usually omit the subscript p from 
the notations. 



1.3. Statistical restricted isometry property (SRIP). The main statement 
of this paper concerns a formulation of a statistical restricted isometry property 
(SRIP for short) of the resolution maps O. 

Let n = n(p) — p 1 ^ 6 , for some < e < 1. Let il n = ([1, n}) denote the set of 
injective maps 

0„ = {S : [l,n] 35}. 

We consider the set VL n as a probability space equipped with the uniform prob- 
ability measure. 

Given a map S £ f2„, it induces a morphism of vector spaces S : C([l,n]) — > 
C (35) given by 5* (Si) = Sgu)- Let us denote by <ds '■ C ([1, n}) — ► H the composition 
60S and by G (S) £ Mat nx « (C) the Hermitian matrix 

G(S) = e* s oQ s . 

Concretely, G (S) is the matrix (gij) where gij = (S (i) , S (j)). In plain language, 
G (S) is the Gram matrix associated with the ordered set of vectors (S (1) , S (n)) 
in H. 

We consider G : f2„ — > Mat nx „ (C) as a matrix valued random variable on the 
probability space f2„ . The following theorem asserts that with high probability the 
matrix G is close to the unit matrix /„ £ Mat nX n (C). 

Theorem 1.2. Let < e <C 1 and let k £ N be an even number such that efc> 1 

Pr (||G - I n \\ > (n/p) 1/(2+e) ) - O ((n/pf k/[2+e) n) . 

For a proof, see Subsection 11.61 

In the above theorem, substituting n = p 1 " 6 yields 

Pr (||G - I n \\ > p- £ /( 2 + e )) = O L-ee(k+l)/(2+e)+l^ 

Equivalently, Theorem 1 1.21 can be formulated as a statistical restricted isometry 
property of the resolution morphism 0. 

A given S £ ft n defines a morphism of vector spaces ®s = Bo5 : C([l,n]) — * Ti 
- in this respect, O can be considered as a random variable 

9 : Q„ -> Mor(C([l,n]),W) . 

Theorem 1.3 (SRIP property). Let < e <C 1 and let k £ N be an even number 
such that ek 3> 1 

Pr(Sup{|||e(/)|| - ll/IH} > (n/p) 1/{2+e) ) =0[(n/pT k/{2+e) n) . 
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1.4. Statistics of the error term. Let E denote the normalized error term 

E = (p/n) 1/2 (G-/ n ). 

Our goal is describe the statistics of the random variable E. Let p E denote the 
spectral distribution of E, namely 

1 n 

PE = - E ^.(E), 

n 8=1 

where Ai (E) > A2 (E) > ... > A„ (E) are the eigen values of E indexed in de- 
creasing order (We note that the eigenvalues of E are real since it is an Hcrmitian 
matrix). The following theorem asserts that the spectral distribution p E converges 
in probability to the Wigner semicircle distribution 

(1-2) Psc (£) = ^4-^-1 [2 ,- 2] (x). 



Theorem 1.4. 



lim p E = p sc . 

p— >oo 



For a proof, see Subsection 11.71 

1.5. The method of moments. The proofs of Theorems 11.21 [l~4l will be based 
on the method of moments. 

Let rrifc denote the kth moment of the distribution p E , that is 

m fe = / x k PlL (x) = -J2 Xi (E) fc = -Tr (E fc ) . 
J n n 

Similarly, let msc,k denote the A:th moment of the semicircle distribution. 
Theorem 1.5. For every k e N, 

(1.3) lim E (m fc ) = m S c,k- 

p — >oo 

In addition, 

(1.4) Var(m k ) =0(n~ 1 ) , 
For a proof, see Section [5J 

1.6. Proof of Theorem 11.21 Theorem 11.21 follows from Theorem 11.51 using the 
Markov inequality. 

Let 5 > and k £ N an even number. First, observe that the condition 

1/2 

|| G- — I n \\ > S is equivalent to the condition j|E|| > (p/n) '5 which, in turns, 

1 /2 

is equivalent to the spectral condition A max (E) > (p/n) ' 8. 

Since, A max (E) fc < X 1 (E) fe + A 2 (E) fe + .. + A„ (E) fe we can write 

Pr (A max (E) > (p/n) 112 s) = Pr (A max (E) fc > (p/nf 2 8 k ) 

< ??(Y:U^(V) k >(pln) k/2 5 k 

= Pr (m fc > n- 1 (p/n) k/2 5 k 
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By the triangle inequality mfe < |rrife — Em k \ + Em k (recall that k is even, hence 
mfc > 0) therefore we can write 

Pr (m k > n- 1 (p/n) k/2 5 k ^j < Pr (\m k - Em k \ > rT 1 (p/n) k/2 S k - Em, 

By Em k = O (1), in addition, substituting 5 = {n/p) 1/{e+2) = (p/n)" 1/(e+2) 
with < e < 1, we get -nT 1 (p/n) k S k = -nT 1 (p/n) ek ^ 2< ' 2+e \ Altogether, we can 
summarize the previous development with the following inequality 

Pr (||G - I n \\ > (n/p) 1/(e+2) ) < Pr (\m k - Em k \ > n" 1 {p/n) ek/2{2+e) + O (1) 

By Markov inequality Pr (\m k — Em k \ > e) < Var (m k ) A 2 - Substituting e = 
n- 1 (p/n) ek/2(2+e) +0(1) we get 

Pr (|m fc - Em k \ > n~ x (p/n) ek/2(2+e) + O (1)) = O (n {n/p) ek/{2+e) ) , 

where in the last equality we used the estimate Var (m k ) = O (n^ 1 ) (see Theorem 

This concludes the proof of the theorem. 

1.7. Proof of Theorem 11.41 Theorem 11.41 follows from Theorem 11.51 using the 
Markov inequality. 

Pr 

In order to show that lim^^co p E = Pgci it is enough to show that for every 
k e N and 5 > we have 

lim Pr(|m fc - m S c,k\ > 5) = 0. 

p — >oo 

The proof of the last assertion proceeds as follows: By the triangle inequality we 
have that \m k - msc,k\ <\m k - Em k \ + \Em k - msc,k\, therefore 

Pr(|m fe - m S c,k\ > 5) < Pr (|m fc - Em k \ + \Em k - m S c,k\ > 6) . 

By (1 1 .3[) there exists Pq S N such that \Em k — msc,k\ < <5/2, for every p > Po, 
hence 

Pr (|m fe - Em k \ + \Em k - m S c,k\ > S) < Pr {\m k - Em k \ >5/2), 
for every p > Pq. Now, using the Markov inequality 

Pr(\ mk - Emk \>S/2)<^0. 

This implies that 

n /i sci ^ r \ / Var(m k ) p ^oo n 

Pr (|m fe - mf| > 6) < — -^-^ -» 0, 

where we use the estimate Var (m k ) = O (1/n) (Equation (|1.4j) ). 
This concludes the proof of the theorem. 



2. Proof of Theorem 11.51 
2.1. Preliminaries on matrix multiplication. 
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2.1.1. Paths. 

Definition 2.1. A path of length k on a set A is a function 7 : [0, fc] — > A. The 
path 7 is called closed if 7 (0) = 7 (fc) . The path 7 is called strict 1/7 (7) 7^ 7 (j + 1) 
/or every j — 0, .., fc — 1. 

Given a path 7 : [0, fc] — > A, an element 7 (j) e A is called a vertex of the path 
7. A pair of consecutive vertices (7 (j) , 7 (j + 1)) is called an edge of the path 7. 

Let Vk (A) denote the set of strict closed paths of length k on the set A and by 
Vk (A, a, b) where a, b G A, the set of strict paths of length k on A which begin at 
the vertex a and end at the vertex 6. 

Conventions: 

• We will consider only strict paths and refer to these simply as paths. 

• When considering a closed path 7 G Vk (A) , it will be sometime convenient 
to think of it as a function 7 : Z/fcZ — > A. 

2.1.2. Graphs associated with paths. Given a path 7, we can associate to it an 
undirected graph G 1 = (V 7 , E 7 ) where the set of vertices V 7 = Im-f and the set of 
edges Ej consists of all sets {a, 6} C A so that either (a, b) or (6, a) is an edge of 7. 

Remark 2.2. Since the graph G 7 is obtained from a path it is connected and, 
moreover, \Vj\ , \E 7 \ < k where k is the length 0/7. 

Definition 2.3. A closed path 7 G Vk {A) is called a tree if the associated graph 
G 7 is a tree and every edge {a, b} G E 7 is crossed exactly twice by 7, once as (a, b) 
and once as (b, a). 

Let 7^ (A) C Vk (A) denote the set of trees of length k. 

Remark 2.4. If 7 is a tree of length k then k must be even, moreover, k = 
2(|V 7 |-1). 

2.1.3. Isomorphism classes of paths. Let us denote by E (A) the permutation group 
Aut (A) . The group E (A) acts on all sets which can be derived functorially from 
the set A, in particular it acts on the set of closed paths Vk (A) as follows: Given 
a G E (A) it sends a path 7 : [0, fc] — ^ A to a o 7. 

An isomorphism class r = [7] G Vk (A) /E (A) can be uniquely specified by a 
k + 1 ordered tuple of positive integers (to, .., Tk) where for each j the vertex 7 (j) 
is the Tjth distinct vertex crossed by 7. For example, the isomorphism class of the 
path 7 = (a, b, c, a, 6, a) is specified by [7] = (1, 2, 3, 1, 2, 1). 

As a consequence we get that 

(2.1) INI = MV T |) = W (14 - 1) ... (\A\ \V y \ + 1) . 

2.1.4. The combinatorics of matrix multiplication. First let us fix some general 
notations: If the set A is [l,n] then we will denote 

• Vk=Vk{[l,n\),Vk{i,j) = V k {[l,n},i,j). 
. T k = T k ([l,n}). 

• £„ = £([l,n]). 

Let M G Mat„xn (C) be a matrix such that ma = 0, for every i G [l,n]. The 
entry mf ^ of the fcth power matrix M k can be described as a sum of contributions 
indexed by strict paths, that is 
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where u> 7 = m 7 ( ). 7 (i) • ti 7 (i). 7 (2) ■ •• • «i 7 (fc-i), 7 (fc) ■ Consequently, we can describe 
the trace of M k as 

(2.2) Tr(M k ) = E w ^= E w n 

2.2. Fundamental estimates. Our goal here is to formulate the fundamental 
estimates that we will require for the proof of theorem 11.51 
Recall 

m fe = n- x Tr (E fe ) = rT 1 (p/nf 2 Tr ((G-7„) fe ) . 

Since (G—I n ) u = for every i 6 [l,n] we can write, using Equation (|2.2p . the 
moment m^ in the form 

(2.3) m fc = n- 1 (p/nf 2 £ w 7 , 

where w 7 : f2„ — ► C is the random variable given by 

w 7 (5) = (5o 7 (0),5o 7 (l)).....{5o 7 (fc-l),5o 7 (fc)). 
Consequently, we get that 

(2.4) Em k = n^ 1 (p/n) k/2 ^ £w 7 . 

Lemma 2.5. Let a G E n £/ien .Ew 7 = £w ff (j. 
For a proof, see Appendix |A1 

Lemma 12.51 implies that the expectation _Bw 7 depends only on the isomorphism 
class [ 7 ] therefore we can write the sum (|2.4p in the form 

Em k = J2 " _1 (p/n) fc/2 |r|Bw T) 

where i?w T denotes the expectation i?w 7 for any 7 € r. Let us denote 
n(r) = n- 1 (p/n) fe/2 |r| = jtT 1 (p/n) fc/2 n^l =p fc / a n l^l- 1 -*/ 2 , 

where in the second equality we used (|2.ip . We conclude the previous development 
with the following formula 

(2.5) Em k = n(T)Ew T . 

Theorem 2.6 (Fundamental estimates). Let r € Vk/^n- 
(1) 7/jfe > 2(|K| - 1) then 

(2.6) lim n (r) £w r = 0. 

p — >oo 



(2) If k < 2 (\V T \ — 1) and r is not a tree then 
7 ) lim n (r) £w r = 0. 

p^oo 

(3) If k < 2 (\V T \ — 1) and t is a tree then 
(2.8) lim n(T)Exv T = 1. 

p— KXD 

For a proof, see Subsection [ 
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2.3. Proof of Theorem ll.5l The proof is a direct consequence of the fundamental 
estimates (Theorem I2.6|) . 

2.3.1. Proof of Equation il.3\) . Our goal is to show that lim^^oo Em k = msc.k- 
Using Equation (|2.5p we can write 



(2.9) lim Em k = V lim n (r) Ew T . 

p — »-oo ' J p— >oo 

rev k /s n 

When k is odd, no class r S Vk/^n is a tree (see Remark 12 .4[) . therefore by 
Theorem l2.6l all the terms in the right side of (|2.9p are equal to zero, which implies 
that in this case lim^^^ Em k = 0. When A; is even then, again by Theorem 12.61 
only terms associated to trees yields a non-zero contribution to the right side of 
therefore in this case 

lim Em.k — lim n (r) £w T = 1 = |7jU . 

p — >oo ^ — ' p — >oo ^ — ' 

rer fc /s„ rer t /s„ 
For every m S N, let n m denote the mth Catalan number, that is 

2m \ 1 



m J m + 1 

On the one hand, the number of isomorphism classes of trees in 7fe/S„ can be 
described in terms of the Catalan numbers: 

Lemma 2.7. If k — 2m, m £ N then 

I ^2m | = K m ■ 

For a proof, see Appendix [SJ 

On the other hand, the moments msc,k of the semicircle distribution are well- 
known and can be described in terms of the Catalan numbers as well: 

Lemma 2.8. If k = 1m then msc,k — otherwise, if k is odd then msc,k = 0. 

Consequently we obtain that for every k G N 

lim Em k = m S c,k- 

p — >oo 

This concludes the proof of the first part of the theorem. 

2.3.2. Proof of Equation By definition, Var (nu.) = Em 2 . — (Emk) 2 ■ 

Equation (|2.3|) implies that 

Em 2 k = n- 2 (p/n) k £ £(w 7l w 72 ), 

Equation (|2.4p implies that 

(Em k f = n~ 2 {p/n) k Ew yi Ew^ 2 

When Vy n Vy = 0, i?(w 7i w 72 ) = £w 7i £w l2 . If we denote by J k C V k X V k 
the set of pairs (71, 72) such that V ll n ^ then we can write 

Var (m fe ) = 7tT 2 (p/nf (£'(w 7l w 72 ) - £'w 7l £ , w 72 ) . 

(7i,7 2 )6 :z: fc 

The estimate of the variance now follows from 
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Lemma 2.9. 

n- 2 (p/n) k Y, l^( w 7 1 w 7 J| = ©(n- 1 ), 

(7i,7 2 )e I fc 

n- 2 (p/n) k l^-Til l- 51 ^! = 0(n- 1 ). 

For a proof, see Appendix IA1 

This concludes the proof of the second part of the theorem. 

2.4. Proof of Theorem 12.61 We begin by introducing notation: Given a set A 
we denote by VL (A) the set of injective maps 

SI (A) — {S : A ^ D} , 

and consider Jl (A) as a probability space equipped with the uniform probability 
measure. 

2.4.1. Proof of Equation t2.6\) . Let t = [7] € Pk/^n be an isomorphism class and 
assume that k > 2 — 1). Our goal is to show that 

lim n(r) |£w T | = 0. 

On the one hand, by Equation (12. lp . we have that |[7]| ~ n'^', therefore 
(2.10) n(r) = (p/n) fe/2 |[ 7 ]| ~ p W n W,\-i-k/2 _ 

On the other hand 

ev, t = \Qn\- 1 y w 7 W = (^)r 1 E w 7 (5) • 

sen„ seQ(v T ) 

By the triangle inequality, |i?w 7 | < |S1 (V^)\ 1 J2sen(V-,) l w 7 (^)l' moreover, by 
the incoherence condition (Equation (JTTTJ) ) 

|w 7 (S)|<//V fc/2 , 

for every 5 € £7 (^y)- In conclusion, we get that |-Ew 7 | < ^ k / 2 p^ k / 2 which combined 
with ([23U]) yields 

n(r)\Ew T \=o(n^- 1 - k / 2 ) P ^0, 

since, by assumption, |V^.| — 1 — fc/2 < 0. 
This concludes the proof of Equation (|2.6[) . 

2.4.2. Proof of Equations {2. 7[ ) and V2.8\) . Let r = [7] € Vk/^n be an isomorphism 
class and assume that k < 2 (\V 7 \ — 1). We prove Equations (|2.7|) . (|2.8p by induction 
on \Vy\. 

Since k < 2 (\Vy\ — 1), there exists a vertex v = 7 (io) where < «o < k — 1, 
which is crossed once by the path 7. Let vi — 7 (io — 1) and v r = 7 (io + 1) be the 
adjacent vertices to v. 

We will deal with the following two cases separately: 

• Case 1. vi 7^ v r . 

• Case 2. vi = v r . 
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Iv U) 



Introduce the following auxiliary constructions: 

If vi v r , let 7 ff : [0, k — 1] — > [l,n] denote the closed path of length k— 1 
defined by 

7 (j) J < *o — 1 

7 (i + 1) *o < i < * - 1 ' 

In words, the path 7- is obtained from 7 by deleting the vertex v and inserting 
an edge connecting vi to v r . 

If = v r , let 7 ff : [0, fe — 2] — * [l,n] denote the closed path of length k — 2 
defined by 

lv[J) 1 70 + 2) k<j<k-2 ■ 

In words, the path 7^ is obtained from 7 by deleting the vertex v and identifying 
the vertices vi and v r . 

In addition, for every u£ — {u, vj, v r }, let 7 U : [0, k] — > [1, n] denote the closed 
path of length defined by 

f 7 CO j < io ~ 1 
7u (j) = < m j = io 

[ 7(i) «o + l<?<fc 
In words, the path 7 U is obtained from 7 by deleting the vertex v and inserting 
an edge connecting vi to it followed by an edge connecting it to w r . 

Important fact: The number of vertices in the paths 7^, 7„ is \Vy \ — 1. 
The main technical statement is the following relation between the expectation 
Ew~/ and the expectations £7w 7 _, Ew 1 . 

Proposition 2.10. 

(2.11) Ew y r^p-i-Ew^ - (p|X|) _1 ^£;w 7ii . 

u 

For a proof, see Appendix lAl 
Analysis of case 1. 

In this case the path 7 is not a tree hence our goal is to show that 

lim n (r) Ew T = 0. 

p — >oo 

The length of 7^ is k — 1 and \ V~/ \ = \ V 7 \ — 1, therefore 

n (r) ~ //2 n l^|-i-fc/2 „ pi/2 n i/2 n ([75?]) _ 

The length of r y u is k and VCy | = \Vy \ — 1, therefore 

n(r) -p^nl^l- 1 -*/ 2 ~n-n([ 7 J). 

Applying the above to (|2.1ip we obtain 

n (r) Ew T ~ (n/p) 1 ' 2 n ([7^]) £ , w 7c . 

By estimate (|2.6p and the induction hypothesis n ([79]) £w 7s = O (1), therefore 
linip^oo n (r) Ev? T = 0, since (n/p) = o (1) (recall that we take n = p 1_e ). 
This concludes the proof of Equation (|2.7[) . 
Analysis of case 2. 

The length of 7^ is k — 2 and \V y \ = \ V y \ — 1, therefore 
n(r)^// 2 nl y -'l- 1 - fc / 2 ^pn([7 5 ]). 
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The length of 7 U is k and | V Ju \ = 
n(r) ~p k/2 n 




Applying the above to (|2. yields 

n (t) Ev,r T ~ n ([7^]) £ , w 7c . 

If 7 is a tree then 7- is also a tree with a smaller number of vertices, therefore, 
by the induction hypothesis linip^oo n ([7^]) Ew Jn ~ 1 which implies by (|2.1ip that 
linip^oo n (r) Ew T = 1 as well. 

This concludes the proof of Equation (|2.8[) . 



3.1. Representation theory. We start with some preliminaries from representa- 
tion theory of the finite Heisenberg group and the associated Weil representation 
(see [17] for a more detailed introduction). 

3.1.1. The Heisenberg group. Let (V,uj) be a two-dimensional symplectic vector 
space over the finite field F p . The reader should think of V as F p x ¥ p with the 
standard symplectic form 



Considering V as an Abelian group, it admits a non-trivial central extension 
called the Heisenberg group. Concretely, the group H can be presented as the set 
H = V x F p with the multiplication given by 



The center of H is Z = Z(H) = {(0, z) : z e F p } . The symplectic group Sp = 
Sp(V,u>), which in this case is just isomorphic to SL2 (F p ), acts by automorphism 
of H through its tautological action on the ^-coordinate, that is, a matrix 



sends an element (v,z), where v = (t,w) to the element (gv,z) where gv — 
(ar + bw, ct + dm). 

3.1.2. The Heisenberg representation . One of the most important attributes of the 
group H is that it admits, principally, a unique irreducible representation. The 
precise statement goes as follows: Let ip : Z — > S 1 be a non-degenerate unitary 
character of the center, for example, in this paper we take i\> (z) — e~ z . It is not 
difficult to show [57] that 

Theorem 3.1 (Stone- von Neuman). There exists a unique (up to isomorphism) 
irreducible unitary representation 7r : H — > U (Ti) with central character ip, that is, 
7r (z) — ip (z) ■ Idji, for every z G Z . 

The representation n which appears in the above theorem will be called the 
Heisenberg representation. 

The representation it : H — > U (Ti.) can be realized as follows: Ti is the Hilbert 
space C(F p ) of complex valued functions on the finite line, with the standard inner 
product 



3. Examples of incoherent dictionaries 



to ((r, w) , (r , w')) — tw — wt . 



(v, z) ■ (v', z') = (v + v', z + z' + v')). 




(f,g) = Y<f( t )9(t), 
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for every f,g£ C(F P ), and the action tt is given by 

• 7r(r,0)[/](t) = /(t + r); 

. tt(0, «>)[/] (x) = ^(wt)f(t); 
. i(z)[/](t)=^( Z )/W, Z eZ. 

Here we are using r to indicate the first coordinate and w; to indicate the second 
coordinate of V ~ F p x F p . 

We will call this explicit realization the standard realization. 

3.1.3. The Weil representation . A direct consequence of Theorem 13. II is the exis- 
tence of a projective unitary representation p : Sp — > PU(TL). The construction of 
/5 out of the Heisenberg representation tt is due to Weil [3U] and it goes as follows: 
Considering the Heisenberg representation tt : H —* U (Ti) and an element g E Sp, 
one can define a new representation tt 9 : H — > U (Ti.) by tt 9 (h) — tt (g (h)). Clearly 
both 7T and it 9 have the same central character -0 hence by Theorem 13.11 they are 
isomorphic. Since the space of intertwining morphisms Hom# (tt, tt 9 ) is one dimen- 
sional, choosing for every g € Sp a non-zero representative ~p(g) € Hom# (-7T, tt 9 ) 
gives the required projective representation. 

In more concrete terms, the projective representation ~p is characterized by the 
Egorov's condition: 

(3.1) p(g)n(h)p(g- 1 ) =TT(g(h)) : 

for every g £ Sp and h £ H. 

The important and non-trivial statement is that the projective representation ~p 
can be linearized in a unique manner into an honest unitary representation: 

Theorem 3.2. There exists a uniqu^ unitary representation 

p:Sp^ U(H), 

such that every operator p(g) satisfies Equation \3. 

For the sake of concreteness, let us give an explicit description (which can be 
directly verified using Equation (|3.1[) ) of the operators p(g), for different elements 
g € Sp, as they appear in the standard realization. The operators will be specified 
up to a unitary scalar. 

• The standard diagonal subgroup A C Sp acts by (normalized) scaling: An 
element 

fa 
a = [o a-i 
acts by 

(3.2) S a [f}(t) = cr(a)f(a-H), 

where a : — » {±1} is the unique non-trivial quadratic character of 
the multiplicative group F£ (also called the Legendre character), given by 

a(a) — aT~(modp). 

• The subgroup of strictly lower diagonal elements U C Sp acts by quadratic 
exponents (chirps): An element 

'1 0^ 
,u 1 . 



^Unique, except in the case the finite field is F3 
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acts by 

M u [f] (t)=i>(-%t 2 )f(t) 

The Weyl element 

' 1 N 
-1 0, 

acts by discrete Fourier transform 



w = 



F[f] («0 = -^ £>(*>*)/(*) 



3.2. The Heisenberg dictionary. The Heisenberg dictionary is a collection of 
p + 1 orthonormal bases, each characterized, roughly, as eigenvectors of a specific 
linear operator. An elegant way to define this dictionary is using the Heisenberg 
representation |20j . 

3.2.1. Bases associated with lines. The Heisenberg group is non-commutative, yet it 
consists of various commutative subgroups which can be easily described as follows: 
Let L C V be a line in V. One can associate tola commutative subgroup Al C H, 
given by Al = {(1,0) ■ I £ L}. It will be convenient to identify the group Al 
with the line L. Restricting the Heisenberg representation it to the commutative 
subgroup L, namely, considering the restricted representation 7r : L — > U (H), one 
obtains a collection of operators {n (I) : I £ L} which commute pairwisely. This, in 
turns, yields an orthogonal decomposition into character spaces 

H=®H X , 

x 

where x runs in the set L of unitary characters of L. 

A more concrete way to specify the above decomposition is by choosing a non- 
zero vector Iq 6 L. After such a choice, the character space H. x naturally corre- 
sponds to the eigenspace of the linear operator 7r (Iq) associated with the eigenvalue 
A = x(/o). 

It is not difficult to verify in this case that 

Lemma 3.3. For every x G £ we have dim7Y x = 1. 

Choosing a vector ip x £ TL X of unit norm k^-JI = 1, for every ^61 which 
appears in the decomposition, we obtain an orthonormal basis which we denote by 
B L - 

Theorem 3.4 ([19, 20 ). For every pair of different lines L, M <zV and for every 
ip £ B L , (j) £ B M 

Since there exist p + 1 different lines in V , we obtain in this manner a collection 
of p + 1 orthonormal bases 

fff = [J B L . 

LCV 

which are /i = 1-coherent. We will call this dictionary, for obvious reasons, the 
Heisenberg dictionary. 
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3.3. The oscillator dictionary. Reflecting back on the Heisenberg dictionary we 
see that it consists of a collection of orthonormal bases characterized in terms of 
commutative families of unitary operators where each such family is associated 
with a commutative subgroup in the Heisenberg group H, via the Heisenberg rep- 
resentation 7r : H — > U (H). In comparison, the oscillator dictionary [16j 02] is 
characterized in terms of commutative families of unitary operators which are as- 
sociated with commutative subgroups in the symplectic group Sp via the Weil 
representation p : Sp — ► U (H) . 

3.3.1. Maximal tori. The commutative subgroups in Sp that we consider are called 
maximal algebraic tori [3] (not to be confused with the notion of a topological torus). 
A maximal (algebraic) torus in Sp is a maximal commutative subgroup which 
becomes diagonalizable over some field extension. The most standard example of 
a maximal algebraic torus is the standard diagonal torus 

Standard linear algebra shows that up to conjugation^ there exist two classes 
of maximal (algebraic) tori in Sp. The first class consists of those tori which are 
diagonalizable already over ¥ p , namely, those are tori T which are conjugated to 
the standard diagonal torus A or more precisely such that there exists an element 
g € Sp so that g -T ■ g^ 1 = A. A torus in this class is called a split torus. 

The second class consists of those tori which become diagonalizable over the 
quadratic extension F p 2, namely, those are tori which are not conjugated to the 
standard diagonal torus A. A torus in this class is called a non-split torus (some- 
times it is called inert torus). 

All split (non-split) tori are conjugated to one another, therefore the number of 
split tori is the number of elements in the coset space Sp/N (see [1] for basics of 
group theory), where N is the normalizcr group of A; we have 

and the number of non-split tori is the number of elements in the coset space Sp/M , 
where M is the normalizer group of some non-split torus; we have 

#(Sp/M)=p(p-l), 

Example of a non-split maximal torus. It might be suggestive to explain further 
the notion of non-split torus by exploring, first, the analogue notion in the more 
familiar setting of the field R. Here, the standard example of a maximal non-split 
torus is the circle group SO (2) C SX2O8.). Indeed, it is a maximal commutative 
subgroup which becomes diagonalizable when considered over the extension field C 
of complex numbers. The above analogy suggests a way to construct examples of 
maximal non-split tori in the finite field setting as well. 

Let us assume for simplicity that —1 does not admit a square root in ¥ p or 
equivalently that p = lmod4. The group Sp acts naturally on the plane V — 



Two elements hi, ha in a group G are called conjugated elements if there exists an element 
g £ G such that g-hfg = h,2- More generally, Two subgroups Hi, Ha C G are called conjugated 
subgroups if there exists an element g G G such that g ■ Hi ■ g~ 1 = Ha. 
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F p x F p . Consider the standard symmetric form BonV given by 

B((x, y), (x', y')) = xx' + yy'. 

An example of maximal non-split torus is the subgroup SO = SO {V,B) C Sp 
consisting of all elements g £ Sp preserving the form B, namely g £ SO if and 
only if B{gu,gv) = B(u,v) for every u, v £ V. In coordinates, SO consists of all 
matrices A £ SL2 (F p ) which satisfy AA l = I. The reader might think of SO as 
the "finite circle" . 

3.3.2. Bases associated with maximal tori. Restricting the Weil representation to a 
maximal torus T C Sp yields an orthogonal decomposition into character spaces 

(3.3) H=®H X , 

x 

where x runs in the set T of unitary characters of the torus T. 

A more concrete way to specify the above decomposition is by choosing a gen- 
eratoJl to £ T, that is, an element such that every t £ T can be written in the 
form t = to, for some n £ N. After such a choice, the character spaces Ti x which 
appears in (|3.3|l naturally corresponds to the eigenspace of the linear operator p (to) 
associated to the eigenvalue A = x (to)- 

The decomposition (|3.3[) depends on the type of T in the following manner (for 
details see [15]): 

• In the case where T is a split torus we have dim7i x = 1 unless x = °~, 
where a : T — > {±1} is the unique non-trivial quadratic character of T (also 
called the Legendre character of T), in the latter case dim7t CT = 2. 

• In the case where T is a non-split torus then dim TL X = 1 for every character 
X which appears in the decomposition, in this case the quadratic character 
a does not appear in the decomposition. 

Choosing for every character x G T 1 x 7^ o~, a vector tp x £ TL X of unit norm, we 
obtain an orthonormal system of vectors Bt = {f x '■ X a } ■ 

Important fact: In the case when T is a non-split torus, the set Bt an 
orthonormal basis. 

Example 3.5. It would be beneficial to describe explicitly the system Ba when 
A ~ G m is the standard diagonal torus. The torus A acts on the Hilbert space H 
by scaling (see Equation li3.2\) ). 

For every x 7^ o~, define a function tp x £ C (F p ) as follows: 



t = 

It is easy to verify that tp x is a character vector with respect to the action p : A — > 
U (H) associated to the character \ ■ a. Concluding, the orthonormal system Ba 
is the set {(p x : x £ G m , x ¥= ""}• 

Theorem 3.6 ([16]). Let <E B Tl and ip £ B T . 2 



maximal torus T in SL2 (F p ) is a cyclic group, thus there exists a generator. 
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Since there exist p (p — 1) distinct non-split tori in Sp, we obtain in this manner 
a collection of p (p — 1) orthonormal bases 

»o = ]J B T . 

TcSp 
non-split 

which are // = 4-coherent. We will call this dictionary the Oscillator dictionary. 
3.4. The extended oscillator dictionary. 

3.4.1. The Jacobi group. Let us denote by J the semi-direct product of groups 

J = Sp ix H. 

The group J will be referred to as the Jacobi group. 

3.4.2. The Heisenberg-Weil representation. The Heisenberg representation n : H — > 
U (H) and the Weil representation p : Sp — > U (Tt) combine to a representation of 
the Jacobi group 

T = p\XTr: J ^>U(H), 
defined by r{g,h) — p(g)Tt(h). The fact that t is indeed a representation is a 
direct consequence of the Egorov's condition - Equation (|3.ip . We will refer to the 
representation r as the Heisenberg-Weil representation. 

3.4.3. maximal tori in the Jacobi group. Given a non-split torus T C Sp, the con- 
jugate subgroup T v = vTv^ 1 C J, for every v € V (the multiplication is in the 
group J), will be called a maximal non-split torus in J. 

It is easy to verify that the subgroups T V ,T U are distinct for v ^ u; moreover, 
for different tori T ^ T' C Sp the subgroups T v , T' u are distinct for every v, u G V. 
This implies that there are p (p — 1) p 2 non-split maximal tori in J. 

3.4.4. Bases associated with maximal tori. Restricting the Heisenberg-Weil repre- 
sentation t to a maximal non-split torus T C J yields a basis Bt consisting of 
character vectors. A way to think of the basis Bt is as follows: If T = T v where T 
is a maximal torus in Sp then the basis Bt v can be derived from the already known 
basis Bt by 

Bt v = 7r (v) Bt, 

namely, the basis -Bt„ consists of the vectors 7T (v) ip where ip. G Bt- 

Interestingly, given any two tori T±,T% C J, the bases Bt x ,Bt 2 remain [i — 4 - 
coherent - this is a direct consequence of the following generalization of Theorem 

EU 

Theorem 3.7 f[16j). Given (not necessarily distinct) tori T\,T% C Sp and a pair 
of distinct vectors ip € Bt x , 4> S Bt 2 

\(<p,ir(v)<l>)\ < -j=, 

for every v S V . 

Since there exist p (p — l)p 2 distinct non-split tori in J, we obtain in this manner 
a collection of p (p — 1) p 2 ~ p A orthonormal bases 

Qeo = \\ Bt. 

TCJ 
non-split 
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which arc jj, = 4-coherent. We will call this dictionary the extended oscillator 
dictionary. 

Remark 3.8. A way to interpret Theorem \3.7\ is to say that any two different 
vectors tp =/= <f> £ 3o are incoherent in a stable sense, that is, their coherency 
is no matter if any one of them undergoes an arbitrary time/phase shift. 

This property seems to be important in communication where a transmitted signal 
may acquire time shift due to asynchronous communication and phase shift due to 
Doppler effect. 

Appendix A. Proof of statements 

A.l. Proof of Lemma 12.51 Let 7 S Vk and a € £„. By definition, a (7) = (707 : 
[0,k] [l,n]. Write 

£w CT(7) = i^r 1 J2 w ^(7) ( s ) ■ 

sen„ 

Direct verification reveals that w CT ( 7 ) (S) = w 7 (o (S)) where a (S) = S o a : 
[1, n] — > ®, hence 

XI Wct W = X W 7( Cr ( 5 ))= X W 7( 5 ') I 

SGf2„ sefi„ sen„ 

which implies that Bw^u) = £w 7 . 

This concludes the proof of the lemma. 

A. 2. Proof of Lemma 12.71 We need to introduce the notion of a Dick word. 

Definition A.l. A Dick work of length 2m is a sequence D = d\d2-..d2m where 
d{ = ±1, which satisfies 

1 

J2 d i > °> 

i=l 

for every I — 1, .., 2m. 

Let us the denote by 2?2m the set of Dick words of length 2m. It is well know 

that |X>2m| = K rn - 

In addition, let us denote by 72 m C T J 2m the subset of trees of length 2m. Our 
goal is to establish a bijection 

Given a tree 7 € Tim define the word D (7) = e?ie?2---c?2m as follows: 

_ J 1 if 7 (i — 1) is crossed for the first time on the i — 1 step 
* \ — 1 otherwise 

The word D (7) is a Dick word since counts the number of vertices 

visited exactly once by 7 in the first / steps, therefore, it is greater or equal to zero. 

On the one direction, if two trees 71,72 are isomorphic then D(pd) = D (72) ■ 
In addition, it is easy to verify that the tree 7 can be reconstructed from the pair 
(D (7) , V 7 ) where V 7 is the set of vertices of 7 equipped with the following linear 
order: 

v < u <^> 7 crosses v for the first time before it crosses u for the first time. 
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This implies that D defines an injection from Ti m jH n into T>2m- 
Conversely, it is easy to verify that for every Dick word D G 2?2m there is a tree 
7 G 'Pik such that D = D (7), which implies that the map D is surjective. 
This concludes the proof of the lemma. 

A. 3. Proof of Lemma 12.91 We begin with an auxiliary construction. Define a 
map U : — > Vik as follows: Given ( 7l , 7 2 ) G 2^, let < i\ < k be the first index 
so that 7j (ii) G y 72 and let < 22 < k be the first index such that r y l — 7 2 (12)- 
Define 

f 7i(i) 0<j<h 
7i u 7 2 C?) = 7 2 t>2 - U + j) «i < j < h + k . 

{ 7i U -k) h + k < j < 2k 

In words, the path 7l U 7 2 is obtained, roughly, by substituting the path 7 2 
instead of the vertex 7l (ii). Clearly, the map U is injective and commutes with the 
action of S„, therefore, we get, in particular, that the number of elements in the 
isomorphism class [71,72] € Ifc/S„ is smaller or equal than the number of elements 
in the isomorphism class [y 1 U 7 2 ] G Vikj^n 

(A.I) |[ 7l , 72 ]| < |[ 7l U 72 ]|. 

First estimate. We need to show 

n- 2 (p/n) k |£( w 7i w 7 2 )l =0{n- 1 ). 

Write 

|^( W Tn. W T 2 )| = X! I [7l > 72] I ' |£( w 7i w 7 2 )| ■ 

(7i,7a)ei* [7i,7 2 ]eife/s„ 
It is enough to show that for every [7^ 7z ] G Zk/^"n 

(A.2) n- 2 (p/n) fe |[ 7l ,7 2 ]l • l#(w 7l w 72 )| = O (n" 1 ) . 

Fix an isomorphism class [71,72] € 2fc/S ra . By Equation (|A.1[) we have that 
I [7i 1 72] I — I [Ti LJ -72] I . In addition, a simple observation reveals that w 7l w 79 
w .... which implies that i?(w 7i w 79 ) = E (Wy U7 J. In conclusion, since the 
length of 71 U 7 2 is 2fc, we get that 

n- 2 (p/n) k |[7i,7 2 ]l ■ |#(w 7l w 7a )| < n" x n ([ 7l U 7 J) |# (w 7l u 7 J | . 

Finally, by Theorem l2.6| we have that n ([71 U 7 2 ]) \E (w 7iU72 ) | = O (1), hence, 
Equation (|A.2[) follows. 

This concludes the proof of the first estimate. 
Second estimate. We need to show 

n- 2 (p/n) k l Sw 7i I l^ w 7 2 1 = O (n" 1 ) . 

(7i.7 2 )e ;r fc 

Write 

\ Ew ii I \ Ew ^\ = H l[7i,7 2 ]l • |^w 7i | |£;w 72 | . 
(71,72)^ [ 7l ,7 2 ]eifc/s„ 
It is enough to show that for every [ 7l , 72 ] G X^/S n 

(A.3) n- 2 (p/n) fe |[7i,7 2 ]|- |^w 7 J |£w 7a | = O (n" 1 ) . 
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Fix an isomorphism class [71,72] € Tfe/E n . By Equation (lA.lj) we have that 
I [7i, 72] I — I [Ti L- 1 72] I- For every path 7, we have that |[7]| = |) ~ w'^ 7 ' (since 
always \ Vj\ < A; and we assume that k is fixed, that is, it does not depend on p), in 
particular 

|[ 7l ]| ~ nKI, 

I [7 2 ] I ~ 
|[ 7l U 72 ]| ~ nK u ^l 

By construction, IV^l^J < l^yj + \V l2 \ — 1 (we assume that V~ fl H V 72 7^ 0), 
therefore, |[7 X U 7 2 ]| =0 (n^ 1 \ | [^2] |) - In conclusion, we get that 

n~ 2 (p/n) |[7i,7 2 ]l ' 1-^7 J |^ w 7 2 | = (n' x n (li) n (12) |^ w 7i| |^ w 7 2 |) > 

where we used the identity rT 2 (p/n) k | [T2] I = n (it) n (72)- Finally, by The- 
orem we have that n (7J \Ew 7i I — 0(1), i — 1,2, hence, Equation (|A.3|) 
follows. 

This concludes the proof of the second estimate and concludes the proof of the 
lemma. 

A. 4. Proof of Proposition I2TT0I Write 

£w 7 = Mvjr 1 J2 w ^) 

sen(v 7 ) 

(A.4) = PiV^r 1 E W 7 (5U6), 

S&a(V y \{v}) b<EV\S(V-,\{v}) 

where S U b : V 1 — > £> is given by 

5 («) u^v 



SUb(u)-- 
Write 

^ w 7 (5 U 6) = 2 w 7 (5 U 6) - ^2 w 7 (5Uft). 

6es\s(K,\{-u}) fees bes{v y \{v}) 

Let us analyze separately the two terms in right side of the above equation. 

First term. 

Write 

E w 7(5u6) = e E w 7(^y- 

Furthermore 

w 7 (S U 6 X ) = (., .) .. (5 (w,) , b x ) (b x , S (v r )) .. (., .) , 
Since B x is an orthonormal basis 

which implies that ^2 w 7 (S \-lb x ) — w 7 „ Concluding, we obtain 

(A.5) ^ w 7 (S LI 6) = |£| w 7o (S) . 

be® 
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Second term. 

Let b S S {Vj\ {v}). Since S is injective, there exists a unique u G V 7 \ {v} such 
that b = S (it), therefore 

w 7 (S U b) = (., .) .. (S («,) , b) (b, S (v r )> .. (., .) = w 7u (S) . 

Furthermore, observe that when u = vi or u = v r we have that w 7 (S) = 
w 7s (S). In conclusion, we obtain 

(A.6) Yj w 7 ( S u 5 ) = 2w 7= ( 5 ) + 51 w -r« ( 5 ) • 

bes(v-,\{v}) uev^\{v,,v r ,v} 

Combining (|A.5|) and (|A.6|) yields 

]T w 7 (Su6) = (|3£l-2)w 7o (5)- ^ w 7u (5). 

b£S\S(K,\{t>}) n6K,\{»,,« r! »} 

Substituting the above in (|A.4p yields 
(A.7) £w 7 = (|X|-2)|n(V 7 )|- 1 £ w 7c (5) 

«£VC,\{B!,e r ,ii} S£n(V 7 \{«}) 

Finally, direct counting argument reveals that 

|0(V 7 )| ~ p|£||Q(V 7 J|, 
|fi(tfy)| ~ p|5E||n(^J|. 

Hence (IA.7I) yields 

£w 7 ~ p _I £w 7ii - ^ (p IXI)" 1 £w 7i _. 

This concludes the proof of the proposition. 
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